Nonparametric Canonical Correlation Analysis

نویسندگان

  • Tomer Michaeli
  • Weiran Wang
  • Karen Livescu
چکیده

Canonical correlation analysis (CCA) is a classical representation learning technique for finding correlated variables in multi-view data. Several nonlinear extensions of the original linear CCA have been proposed, including kernel and deep neural network methods. These approaches seek maximally correlated projections among families of functions, which the user specifies (by choosing a kernel or neural network structure), and are computationally demanding. Interestingly, the theory of nonlinear CCA, without functional restrictions, had been studied in the population setting by Lancaster already in the 1950s, but these results have not inspired practical algorithms. We revisit Lancaster’s theory to devise a practical algorithm for nonparametric CCA (NCCA). Specifically, we show that the solution can be expressed in terms of the singular value decomposition of a certain operator associated with the joint density of the views. Thus, by estimating the population density from data, NCCA reduces to solving an eigenvalue system, superficially like kernel CCA but, importantly, without requiring the inversion of any kernel matrix. We also derive a partially linear CCA (PLCCA) variant in which one of the views undergoes a linear projection while the other is nonparametric. Using a kernel density estimate based on a small number of nearest neighbors, our NCCA and PLCCA algorithms are memory-efficient, often run much faster, and perform better than kernel CCA and comparable to deep CCA.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Canonical Analysis of the Relationship between Components of Professional Ethics and Dimensions of ‎Social Responsibility‌ ‌

  Background: Today, professional ethics and social responsibility play an important role in ‎organizations. This study aimed canonical analysis of the relationship between components ‎of professional ethics and social responsibility dimensions among the first high ‎school teachers in the Naghadeh province.‎‏ ‏ Method: This study, in terms of purpose is application, and in terms of data ‎collec...

متن کامل

A Gaussian process latent variable model formulation of canonical correlation analysis

We investigate a nonparametric model with which to visualize the relationship between two datasets. We base our model on Gaussian Process Latent Variable Models (GPLVM)[1],[2], a probabilistically defined latent variable model which takes the alternative approach of marginalizing the parameters and optimizing the latent variables; we optimize a latent variable set for each dataset, which preser...

متن کامل

The canonical correlation between Contingencies self-worth and adjustment of students

This research aimed at studying the canonical correlation between Contingencies self-worth (Family support, Competition, Appearance, God’s love, Academic competence, Virtue, Approval from others) with adjustment (emotional, Social and academic). In order of this research, 221 university students were selected by random ratio sampling method (1272 cases). Data was gathered through contingencies ...

متن کامل

Canonical Correlation Analysis for Determination of Relationship between Morphological and Physiological Pollinated Characteristics in Five Varieties of Phalaenopsis

Phalaenopsis is an important genus of orchids that is grown for economical production of cut flower and potted plants. The objective of this study is the evaluation of correlation between morphological and physiological traits of self and cross-pollination of 5 varieties of Phalaenopsis orchid. Some morphological traits were measured: Capsule length (CL), capsule volume (CV), weight of seeds in...

متن کامل

Deep Canonical Correlation Analysis

We introduce Deep Canonical Correlation Analysis (DCCA), a method to learn complex nonlinear transformations of two views of data such that the resulting representations are highly linearly correlated. Parameters of both transformations are jointly learned to maximize the (regularized) total correlation. It can be viewed as a nonlinear extension of the linear method canonical correlation analys...

متن کامل

Identification of Risk Factors by Using Macroeconomic and Firm-Specific Variables Simultaneously in Tehran Stock Exchange by Applying Canonical Correlation Analysis

The main objective of this study is to give the insight of describing mixing accounting ratios and macroeconomic variables as the risk factors in Iran. The results indicate a significant relationship between book to market ratio, financial leverage, size factors and expected stock returns in the Iranian market. In consistent with the other studies, we came to the conclusion that the term struct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016